Mini-Minimax Uncertainty Quantification for Emulators∗
نویسندگان
چکیده
Consider approximating a “black box” function f by an emulator f̂ based on n noiseless observations of f . Let w be a point in the domain of f . How big might the error |f̂(w) − f(w)| be? If f could be arbitrarily rough, this error could be arbitrarily large: we need some constraint on f besides the data. Suppose f is Lipschitz with a known constant. We find a lower bound on the number of observations required to ensure that for the best emulator f̂ based on the n data, |f̂(w)− f(w)| ≤ ε. But in general, we will not know whether f is Lipschitz, much less know its Lipschitz constant. Assume optimistically that f is Lipschitz-continuous with the smallest constant consistent with the n data. We find the maximum (over such regular f) of |f̂(w)− f(w)| for the best possible emulator f̂ ; we call this the “mini-minimax uncertainty” at w. In reality, f might not be Lipschitz or—if it is—it might not attain its Lipschitz constant on the data. Hence, the mini-minimax uncertainty at w could be much smaller than |f̂(w)− f(w)|. But if the mini-minimax uncertainty is large, then— even if f satisfies the optimistic regularity assumption—|f̂(w) − f(w)| could be large, no matter how cleverly we choose f̂ . For the Community Atmosphere Model, the maximum (over w) of the mini-minimax uncertainty based on a set of 1154 observations of f is no smaller than it would be for a single observation of f at the centroid of the 21-dimensional parameter space. We also find lower confidence bounds for quantiles of the mini-minimax uncertainty and its mean over the domain of f . For the Community Atmosphere Model, these lower confidence bounds are an appreciable fraction of the maximum. To know that the emulator estimates f accurately would require evidence that f is typically more regular than it is across the n sample values.
منابع مشابه
Multivariate emulation of computer simulators: model selection and diagnostics with application to a humanitarian relief model
We present a common framework for Bayesian emulation methodologies for multivariate output simulators, or computer models, that employ either parametric linear models or non-parametric Gaussian processes. Novel diagnostics suitable for multivariate covariance separable emulators are developed and techniques to improve the adequacy of an emulator are discussed and implemented. A variety of emula...
متن کاملA comparison of statistical emulation methodologies for multi‐wave calibration of environmental models
Expensive computer codes, particularly those used for simulating environmental or geological processes, such as climate models, require calibration (sometimes called tuning). When calibrating expensive simulators using uncertainty quantification methods, it is usually necessary to use a statistical model called an emulator in place of the computer code when running the calibration algorithm. Th...
متن کاملUsing Emulators to Estimate Uncertainty in Complex Models
The Managing Uncertainty in Complex Models project has been developing methods for estimating uncertainty in complex models using emulators. Emulators are statistical descriptions of our beliefs about the models (or simulators). They can also be thought of as interpolators of simulator outputs between previous runs. Because they are quick to run, emulators can be used to carry out calculations ...
متن کاملThe dangers of sparse sampling for the quantification of margin and uncertainty
Activities such as global sensitivity analysis, statistical effect screening, uncertainty propagation, or model calibration have become integral to the Verification and Validation (V&V) of numerical models and computer simulations. One of the goals of V&V is to assess prediction accuracy and uncertainty, which feeds directly into reliability analysis or the Quantification of Margin and Uncertai...
متن کاملA Spatial-Random-Process Based Multidisciplinary System Uncertainty Propagation Approach with Model Uncertainty
The performance of a multidisciplinary system is inevitably affected by various sources of uncertainties, usually categorized as aleatory (e.g. input variability) or epistemic (e.g. model uncertainty) uncertainty. In the framework of design under uncertainty, all sources of uncertainties should be aggregated to assess the uncertainty of system quantities of interest (QOIs). In a multidisciplina...
متن کامل